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Figure 1: Threading motifs and their occurrences. 
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Table 1: Lossy compression of gray-scale images (1 pixel = 1 byte). 



file 


file len 


GZip len 


Coded 


Codeci 


%Diff 


%loss 


'•V 






[%coinpr] 


[%coinpr] 


[%compr] 


grip 




char 


bridge 


66336 


61667(7.05] 


60987[8.06i 


57655(13.08] 


6.49 


0.42 


1/4 








60987(8,06] 


50656(23.63] 


17.84 


14.29 


1/3 


camera 


66336 


48750[26.5i] 


47842(27.88] 


46192(30.36] 


5.25 


0.74 


1/6 








48044(27.57] 


45882(30.83] 


5.88 


2.17 


1/5 








47316(28.67] 


43096(35.03] 


11.60 


9.09 


1/4 


lena 


262944 


234543H2.101 


226844(13.73] 


210786(19.83] 


10.13 


4.17 


1/4 








186359(29.13] 


175126(33.39} 


25.33 


20.00 


1/3 


peppera 


262944 


232334111.64] 


218175(17.03] 


199605(23 j»] 


14.09 


6.25 


1/4 








180783(31.25] 


173561(33.99] 


25.30 


20.00 


1/3 



Table 2: Lossy compression of binary images. 



file 


file len 


GZip len 


Codec2 


Codeci 


%Diff 


%loss 


'■'/ 






[%compr] 


[%compr] 


[%compr] 


GZip 




char 


ccitt? 


513229 


109612[78.64) 


98076(80.89] 


91399(82.19] 


16.62 


16.67 


1/5 








93055(81.87] 


90873(82.29] 


17.10 


16.67 


1/4 








92658(81.95] 


85391(83.36] 


22.10 


25.00 


1/3 


test4 


279213 


58736(78.96] 


57995(79.23] 


54651(80.42] 


6.95 


0,91 


1/4 








57714(79.32] 


54402(80.51) 


7.38 


1.27 


1/3 



Table 3: Lossy compression of music (1 sample = 1 byte). 



file 


file len 


GZip len 


Codeoi 


Codeci 


%Diff 


%loss 


'-'/ 






[%compr] 


[%compr] 


[%compr] 


GZip 




char 


crowd 


128900 


103834(19.44] 


92283(28.41] 


86340(33.01] 


16.85 


16.67 


1/3 


eclipse 


196834 


171846(12.96) 


148880(24.36] 


139308(29.22] 


18,93 


9.09 


1/4 






114709(41.72) 


111058(43.57] 


35.37 


25.00 


1/3 



Table 4: Lossy vs. lossless performance. 



file 


file dim 


GZip 


Codeci 


%loss 


'•7 


Lossless 


%Diff 






[%compr] 


[%compr] 




char 


[%compr] 


GZip 


bridge 


66336 


61657(7.06] 


50656(23.63) 


14.29 


1/3 


59344(10.54] 


3.75 


camera 


66336 


48750(26.51) 


43096(35.03) 


9,09 


1/4 


45756(31,02) 


6.14 


lena 


262944 


234543(12.10] 


175126(33.39) 


20.00 


1/3 


199635(24.07) 


14.88 


peppens 


262944 


232334[u.64) 


199605(23.86) 


6.25 


1/4 


211497(19.56] 


8.97 








173561(33.99] 


20.00 


1/3 


195744(25.55] 


15.75 


ccitt7 


513229 


109612(78.64] 


90873(82.29] 


16.67 


1/4 


97757(80.09] 


10.82 








85391(83,36] 


25.00 


1/3 


89305(82.59] 


18.53 


test4 


279213 


58736[78.96] 


54402(80.51] 


1.27 


1/3 


54875(80.84]^ 


6.57 


crowd 


128900 


103834(19.44) 


86340(33.01) 


16.67 


1/3 


96903(24,82] 


6.68 


eclipse 


196834 


171846(12.96] 


139308(29.22) 


9.09 


1/4 


159206(19.11] 


7.36 








111058(43.57] 


25.00 


1/3 


151584(22.98] 


11.97 




Table 5: Lossless compression of Calgaxy Corpus. 



file 


file len 


GZip 




%loss 


' 7 


Lossless 


%Diff 






[%compr] 


[%compr] 




chai 


[%coinpr] 


GZip 


DID 


111 o(\^ 

111 £tD I 


OOUDo[68.49] 


ODOZO(67.35j 




1 

L/6 


07/1Q1, 

CJ/4y 1(66.30] 




bookl 


/DO/ f i 


Oloa/D[60.011 


iS^OO 00(68.01] 




1 /'i 
1/6 


0771 flflr « , 

^(( ioU(63.95] 


11.00 


DQ01w« 


OlUoOD 


^UOuo I [66.16] 


1 Q71 QQr^« ««i 

lyf iyy[67.72] 




1/1 


£yjZ{ 10(66.81] 


1 Q9 


geo 


102400 


68493(33.11] 


40027(60.91] 


16.67 


1/4 


63662(37.63] 


7.05 


news 


377109 


144840[6i.59] 


144541(61.67] 


0.42 


1/3 


144644(61.64] 


0.14 


objl 


21504 


10323(51.99] 


8386(61.00] 


16.67 


2/5 


9221(57.12) 


10.68 


obj2 


246814 


81631(66.93] 


71123(71.18] 


20.00 


1/2 


83035(66.36] 


-1.72 


paperl 


53161 


18577(65.06] 


19924(62.52] 


1.75 


1/3 


20174(62.05] 


-8.60 


paper2 


82199 


29753(63.80] 


29920(63.60] 


0.76 


1/2 


30219(63.24] 


-1.57 


pic 


513216 


56422(89.01] 


52229(89.82] 


0.56 


1/3 


52401(89.79] 


7.13 


progc 


39611 


13275(66.49] 


13840(65.06] 


1.32 


1/2 


14140(64.30] 


-6.52 


progl 


71646 


16273(77.29] 


17249(75.92] 


0.58 


1/3 


17355(75.78] 


-6.65 


progp 


49379 


11246(77.231 


12285(75.121 


0.64 


1/3 


12427(74.831 


-10.50 



Table 6: Lossless compression of sequences firom DNA yeast families. 



file 


file len 


GZip 


Codeci 


%loss 


'•V 


Lossless 


%Diff 






[%compr] 


[%compr] 




char 


[%compr] 


GZip 


Spor Earlyll 


25008 


8008(67.981 


6990(72.05] 


0.45 


1/3 


7052(71.80] 


11.94 


Spor Earlyl 


31039 


9862(68.23) 


8845(71.50] 


0.36 


1/3 


8914(71.28] 


9.61 


Helden CGN 


32871 


10379(68.43) 


8582(73.89] 


1.33 


1/3 


8828(73.14] 


14.94 


Spor Middle 


54325 


16395(69.82] 


14839(72.68] 


0.36 


1/4 


14924(72.53] 


8.97 


Helden All 


112507 


33829(69.93] 


29471(73.81] 


1.56 


1/4 


29862(73.46] 


11.73 


Spor All 


222453 


68136(69.37] 


56323(74.68] 


1.61 


1/3 


57155(7431] 


16.12 


All Up 400k 


399615 


115023[7i.22l 


93336(76.64] 


14.29 


1/3 


106909(73.25] 


7.05 




Table 7: Synopsis of compression rates for sequences in the yeast DNA by various lossless 
methods. The figure in parenthesis is the percentage gain of Codeci versus other methods. 







Huffman 


LZ-78 


LZ-77 


BWT 


C7odeci 


File 


File Len 


Pack 


Compress 


GZip 


BZip 


Lossless 






[%diff] 


[%diff] 


[%difF] 


[%difF] 




Spor Earlyll 


25008 


7996(13.4) 


7875(11.7] 


8008(13.6] 


7300(3.5] 


7052 


Spor Earlyl 


31039 


9937(11.5] 


9646(8.2] 


9862(10.6] 


9045(1.5] 


8914 


Helden CGN 


32871 


10590(20.0] 


10223(15.8] 


10379(17.6] 


9530(8.0] 


8828 


Spor Middle 


54325 


17295(15.9] 


16395(9.9] 


16395(9.9] 


15490(3.8] 


14924 


Helden All 


112507 


36172(21.1] 


33440(12.0] 


33829(13.3] 


31793(6.5] 


29862 


Spor All 


222453 


70755(23.8] 


63939(11.9] 


68136(19.2] 


61674(7.9] 


57156 


All Up 400k 


399615 


121700(13.8] 


115029f7.6] 


115023(7.6] 


112363(5.1] 


106909 




Table 8: Compression, fidelity and loss in reconstruction of grey scale images. 

















%Loss per 


%Loss per 


File 


File len 


GZip len 


Codeci 


Diff% 


%Loss 


'.'/ 


pixel over 


pixel over 






[%coinpr] 


[%c»mpr] 


GZip 




car 


recon pix 


all pix 


bridge 


66336 


61657(7.05] 


57655(13.08] 


6.49 


0.42 


1/4 


5.67 


0.02 






60987(8.06] 


17.84 


14.29 


1/3 


7.69 


0.90 


camera 


66336 


48750(26.61] 


47316(28.67] 


11.60 


9.09 


1/4 


0.78 


0.05 


lena 


262944 


234543[i2.io] 


210786(19.83] 


10.13 


4.17 


1/4 


7.26 


0.27 






186359(29.13] 


25.33 


20.00 


1/3 


5.11 


0.81 


peppers 


262944 


232334(11.64] 


199605(23.85] 


14.09 


6.25 


1/4 


1.53 


0.08 






180783(31.251 


25.30 


20.00 


1/3 


3.29 


0.52 



^/Y/o )| Identify a plurality of irredundant patterns in a 



data set 



Extract at least a portion of the plurality of irredundant 
patterns from the data set to generate a compressed data set 



^^End^ 



Figure 



DESCRIPTION OF FARMER OAK - AN IffCIDBNT When 
Farmer Oak mUe., the comen ./ fttt nwuih spread iill the. vsere 
within an unimportant distance ./ his ears, his eye, were reduced to 
chinks, and ...erging wrinUered round them, extending upon ... optm- 
Jmi^nfXi lijsM . the Tags inja rudimentary shetch^ of the rising sun. His 
Christian name was Geirid, and on working days he was a young 
man of sound judgment, easy motions, proper dress, and ...era/ good 
character. On Sundays, he was a man of misty views rather given to 
postponing, and .ampered by his best dotes and umbrella : upon ... 
whole, one who felt hifhself to occupy morally thai ... middle space 
of Laodicean neutrality which ... between the Communion people of 
the parish and the drunken section^ - thai ... he went io church, bui 
yawned privaii^ by ike l.tme the cong.egaiiim reached the Nieene 
creed,' and thou^t of whai there nmuU be for dniner u^en he meant 
to be listening to the sermon. 



DESCRIPTION OF FARMER QAM - AN INCIDENT When 
Farmer Oak smUe^ the earners of his mouth spread iOi they were 
within an unimportani distance of his ears, his eyes were reduced io 
chinks, and diverging wrinkles appeared round Uiem, esBiending upon 
his countenance like the rays in a rudimentary sketch of the ri$ktg 
sun. His Christian name was Gabriel, and on working days he was a 
young man of sound judgment, easy motions, proper dress, and gen- 
eral good character. On Sundays he was a man of misty views, rather 
given io postponing, and hampered by his best clothes and umbrelto : 
upon the whole, one whofdi himself io occupy morally thai vast nM- 
die space of Laodicean neuirality which lay between the Communion 
people of the parish and the drunken section, - thai is, he weni io 
church, bui yawned pHoaiety hy the ime ihe congregaOon reached the 
Nioene creed,- and ihoughi of what there would be for dinner when 
he rnennf to be tistenins io ifte serthot^. 



